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POLYNUCLEOTIDE CONSTRUCT ENCODING A POLYPEPTIDE 
SUBSTANTIALLY SIMILAR TO 
HUMAN GLUTATHIONE PEROXIDASE 

Background 

1 . Technical Field 

This invention relates to recombinant DNA, more 
specifically to the cloning and expression of DNA which 
encodes human glutathione peroxidase. 

2 . Background Art 

Peroxides and oxygen derived free radicals, 
generated as a result of a variety cellular reactions, 
have been implicated in the generation of tissue damage 
involving membranes, soluble proteins, and DNA. For 
example, it is thought that a major contributor to 
reperfusion injuries associated with heart attacks, 
stroke, intestinal ischemia, frost bite, spinal cord 
trauma, and organ transplantation are oxygen-derived 
free radicals generated by the xanthine dehydrogenase 
system. Hypobaric oxygen treatment is also associated 
with ischemic type injuries. In addition, patients 
suffering from genetic or alimentary deficiencies in 
glutathione peroxidase (GPx) suffer from hemolytic 
episodes if they are exposed to drugs or xenobiotics 
generating superoxide radicals. & 2 0 2' or lii P id 
peroxides in the red cells. Lipid peroxidation is 



thought to contribute significantly to the aging of the 
brain: drugs such as chlorpromazine which ace 
extensively used in psychiatry, inhibit lipid 
peroxidation and induce the synthesis of glutathione 

peroxidase . 

Glutathione peroxidase (GPx) is a 
selenium-containing enzyme present in mammals and birds 
which catalyzes the reaction 

ROOH + 2 GSH > 2 GSSG + ROH - H 2 0 

where R may be an aliphatic or aromatic organic group, 
or hydrogen. This enzyme is of considerable importance 
because of its ability to remove a wide variety of 
hydroperoxides, but particularly lipid peroxides and 

H,0,. .. • • 

GPx. in conjunction with its cohort enzyme 

superoxide dismutase (SOD) , acts as a scavenger system 
for superoxide radicals and for peroxides. SOD converts 
a superoxide radical to H 2 0 2 . which in turn is 
converted to H 2 0 in the presence of reduced 
glutathione and glutathione peroxidase. It is of 
interest that SOD. itself, is damaged by its product. 
H O . Also. H 0 2 can form a highly reactive and 
damaging hydroxy! radical or superoxide according to the 
following reactions: 

F e +3 + H 2 O z --> Fe +2 + 0~ * 2H + 
and 



Fe +2 + H 2 0 2 --> Fe +3 + OH- + OH 
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This damage may be prevented by adequate levels of GPx. 
In sum. GPx is a key enzyme in reducing cellular damage 
which could occur from the generation of peroxides and 
superoxide radicals . 

In addition GPx serves other roles. For 
example, it modulates the formation of leukotr ienes . 
protaglandins , and thromboxanes: this modulation is the 
result of the GPx 1 s. reduction of intermediate lipid 
peroxides generated in' the arachidonic cascade. Thus, 
the modulation of GPx activity could control 
inflammatory and hypersensitivity reactions. 

It is also possible that GPx is a factor in the 
development of coronary heart disease. A deficiency in 
this enzyme could facilitate the occurrence of lesions 
in the endothelial lining of the vessels, it could 
impair the formation of the antiaggregatory 
prostacyclin, and it could lead to the premature aging 
of red cells. Gpx may also have a role in cataract 
formation in lens. Not only is there a correlation 
between age and decrease in GPx activity in human lens, 
it has also been shown that lenticular GPx activity is 
inversely proportional to severity of human cataract. 
Moreover, the onset of nuclear cataract of the lens is 
accompanied by a precipitous loss of GPx activity. 

GPx has been isolated from different mammalian 
sources, including bovine erythrocytes, ovine 
erythrocytes, rat liver, human erythrocytes, and human 
placenta. Cf Flohe. in FREE RADICALS IN BIOLOGY. VOL V. 
(Pryor. W.A. . ed.. Academic Press. N.Y.) pages 223-227. 

The enzyme isolated from bovine erythrocytes is 
the most thoroughly studied, and serves as a prototype 
for structural considerations. GPx is a soluble 
tetrameric protein of molecular weight about 85.000. It 
contains four identical subunits of approximately 21.000 



daltons. An unusual component of the enzyme is 
selenocysteine. one residue of which occurs in each 
subunit. as revealed by X-ray diffraction analysis and 
complete sequencing of bovine enzyme. W.A. Gunzler et 
al (1984). Hoppe-Seyler :s Z. Physiol. Chem 364. 195. 
Studies with rat liver slices suggest that selenium is 
incorporated into the enzyme via a translational 
pathway, w.c. HawkeP *nfl a.l. Tappel (1983). Biochim. 

Biophys. Acta 739. 225. 

Human GPx could be used therapeutically to 
overcome the effects of genetic deficiencies in the 
enzyme. It also has potential therapeutic use in 
overcoming the effects caused by ischemic trauma, and as 
an'ad junct to hypobaric oxygen treatment. In addition, 
it could find use in the mental disorders related to 
aging, or which are currently treated with 
chlorpromazine. It also may be useful for the 
treatment, both prophylactic and therapeutic, of 
cataracts. However, a limitation in the source of this 
enzyme, i.e. human tissue, is to be expected. 

Activators and inhibitors of GPx activity may 
have use in modulating the inflammatory and 
hypersensitivity reactions which result from products 
produced by the arachidonic acid cascade. To 
efficiently design specific modulators, it is necessary 
to isolate human GPx in sufficient quantity so that it 
can be structurally characterized and its mechanisms of 
action understood. 

V 

Disclosure of the Inve ntion 

One embodiment of the invention is a 
polynucleotide construct encoding a polypeptide 
substantially similar to human GPx. The polynucleotide 



construct contains a heterologous region, which contains 
a coding sequence for the polypeptide. Polypeptides 
substantially similar to human GPx include: human GPx; 
fragments of human GPx; analogs of human GPx. and 
analogs of fragments of human GPx. 

Another embodiment of the invention is a 
composition comprising cDNA encoding human GPx. 

Still another embodiment of the invention is a 
method of producing a recombinant polypeptide 
substantially similar to human GPx . The method for 
producing the polypeptide consists of several steps 
which include the following. First, providing a 
population of transformed cells containing a vector, in 
-which the vector consists of: (1) a coding sequence for 
a polypeptide substantially similar to human GPx; and 
(2) sequences which allow the expression of the coding 
sequence in the cells, wherein these sequences consist 
of at least a promoter which controls transcription of 
the coding sequence, a ribosomal binding sequence, a 
translation termination sequence, and a transcription 
termination sequence. Second, growing the population of 
transformed cells under conditions whereby the 
polypeptide substantially similar to human GPx is 
expressed. Finally, recovering the expressed 
polypeptide. 

Yet another embodiment of the invention is a 
composition containing a polypeptide synthesized by 
expr-es-sion of the above-mentioned polynucleotide 
construct . 

The invention also consists of a composition 
which contains a polypeptide synthesized by expression 
of the polynucleotide construct which encodes a 
polypeptide substantially similar to human GPx. 
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ttrief pe^riptior nrawinqs 

Figure 1 shows the synthetic oligonucleotides 
used as probes in the isolation of cDNA encoding bovine 
GPX. 

Figure 2 shows a comparison of bovine, mouse. 

•and human GPx cDNA sequences. 

Figure 3 shows the homology of bovine, mouse, 
and human preprocessed GPx putative translation products. 

wn^s for r*rrvina O "^ Tnvention 

The practice of the present invention will 
employ, unless otherwise indicated, conventional 
techniques of molecular biology, microbiology, and 
recombinant DNA. which are within the sKill of the art. 
Such techniques are explained fully in the literature, 
see e.g.. Maniatis. Fritsch &- Sambroo*. MOLECULAR 
CLONING: A LABORATORY MANUAL (1982): DNA CLONING. 
Volumes I and II (B.N. Glover ed. 1985): OLIGONUCLEOTIDE 
SYNTHESIS (M.J. Gait ed. 1984): NUCLEIC ACID 
HYBRIDIZATION (B.D. Hames & S.J. Higgins edS. 1984): 
TRANSCRIPTION AND TRANSLATION (B.D. Hames & S.J. Higgins 
eds 1984): ANIMAL CELL CULTURE (R.I. Freshney ed . 
1986)- IMMOBILIZED CELLS AND ENZYMES (IRL Press. 1986): 
B Perbal. A PRACTICAL GUIDE TO MOLECULAR CLONING 
(1984): and the series. METHODS IN ENZYMOLOGY (S. 
ColowicK and N. Kaplan eds. Academic Press. Inc.). 

In describing the present invention, the 
following terminology will be used in accordance with 
the definitions set out below. 

A "nucleotide construct" refers to the 
polymeric form of nucleotides, either ribonucleotides or 
deoxyribonucleotides. This term refers only to the 
primary structure of the molecule.. Thus, this term 
includes double- and single-stranded DNA as well as 
double- and single-stranded RNA. 
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A "replicon" is any genetic element (e.g., a 
plasmid. a chromosome, a virus) that behaves as an 
autonomous unit of polynucleotide replication within a 
cell; i.e. capable of replication under its own control. 

A "vector" is a replicon in which another 
polynucleotide segment is attached, so as to bring about 
the replication and/or expression of the attached 
segment . 

A "coding sequence" is a polynucleotide 
sequence which is transcribed and/or translated into a 
polypeptide when placed under the control of appropriate 
regulatory sequences. The boundaries of the coding 
sequence are. determined -by . a translation start codon at 
the 5' -terminus and a translation stop codon at the 
3 '-terminus. A coding sequence can include, but is not 
limited to. prokaryotic sequences. cDNA f rom. eukaryotic 
mRNA. genomic DNA sequences from eukaryotic DNA. and 
even synthetic DNA sequences. A polyadenylation signal 
and transcription termination sequence will usually be 
located 3* to the coding sequence. 

A "promoter sequence" is a DNA regulatory 
region capable of binding RNA* polymerase and initiating 
transcription of a downstream (i.e. in the 3' direction) 
coding sequence. In the present invention, the promoter 
sequence is bounded at its 3 1 -terminus by the 
translation start codon of a coding sequence and extends 
upstream (i.e. in the 5 1 direction) to include the 
minimum number of bases or elements necessary to 
initiate transcription at^ levels detectable above 
background. Within the promoter sequence will be found 
a transcription initiation site (conveniently determined 
by mapping with nuclease SI) . as well as protein binding 
domains (consensus sequences) responsible for the 
binding of RNA polymerase. 
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A coding sequence is "under the control 11 of the 
promoter sequence in a cell when transcription of the 
coding sequence results from the binding of RNA 
polymerase to the promoter sequence; translation of the 
resulting mRNA then results in the polypeptide encoded 
within the coding sequence. 

"Transformation" is the insertion of an 
exogenous polynucleotide into a host cell. The 
exogenous polynucleotide may be maintained as a plasmid. 
or alternatively, may be integrated within the host 
genome. In cases where the exogenous polynucleotide is 
RNA, the RNA or a portion thereof will be reverse 
transcribed into DNA prior to integration. 

Two DNA sequences are "substantially 
homologous" when at least about 90%. and preferably at 
least about 95%. of the nucleotides match over the 
defined length of the DNA sequences. Sequences that are 
substantially homologous can be identified by 
hybridization experiments, wherein the hybridization is 
carried out under stringent conditions as defined for 
that particular system. See. e.g.. Maniatis et al. , 
supra : DNA CLONING. Vols. I & II. supra : NUCLEIC ACID 
HYBRIDIZATION, supra . Detection of non-hybridized 
regions may be. for example: by SI nuclease degradation 
of non-hybridized segments followed by size analysis by 
gel electrophoresis: or by heteroduplex analysis by 
electron microscopy. 

A "polypeptide which is substantially similar 
to human GPx" includes human GPx. analogs of human GPx 
and analogs of human GPx fragments. An "analog of human 
GPx" or of a "fragment of human GPx" is one in which the 
homology to human GPx is greater than about 90%, and 
preferably greater than about 95%. Analogs include 
e.g.. polypeptides which lac* the selenium moiety, or 



ones in which the se,lenium moiety is joined to an amino 
acid other than cysteine, as well as molecules which 
have amino acid substitutions. Analogs of human GPx and 
of human GPx fragments give rise to antibodies which 
react preferentially with human GPx when compared to GPx 
derived from other species. Preferential reactivity of 
antibodies is assayed by competition immunoassay, which 
is well known to those skilled in the art. A 
"polypeptide which is substantially similar to human 
GPx" also includes fragments of human GPx. both active 
and inactive. An "active fragment" of human GPx is that 
portion of the polypeptide which will, with the 
appropriate cofactors. catalyze the reaction normally 
catalyzed by the native~enzyme . An inactive fragment of 
human GPx is a peptide or polypeptide which has the 
sequence of a portion of the human GPx molecule. An 
inactive fragment of human GPx is recognized by its 
preferential reactivity with an antibody to human GPx. 

A "clone" is a population of cells derived from 
a single cell. A "cell line" is a clone of a primary 
cell that is capable of stable growth in vitro for many 
generations. 

A "heterologous region" of a polynucleotide 
construct is an identifiable segment of polynucleotide 
within the larger polynucleotide molecule that is not 
found in association with the larger molecule in 
nature. Thus, when the heterologous region encodes a 
mammalian gene, the heterologous region will be flanked 
by a nucleotide sequence that does not flank it in the 
genome of the source organism. Another example of a 
heterologous coding sequence is a construct where the 
coding sequence itself is not found in nature (e.g.. a 
cDNA where the genomic coding sequence contains introns, 
or a synthetic sequence having one or more codons 
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different than the native gene). Allelic variations or 
naturally occurring mutational events do not give rise 
to a heterologous region of DNA. 

A nucleotide construct encoding human GPx may- 
be used in methods to produce the enzyme by recombinant 
methods. It may- also be used in gene therapy to 
alleviate genetic disorders producing enzyme deficiency. 

DNA . encoding human GPx was isolated by a method 
which relied, in part, on nucleic acid hybridization. 
Since data on both the nucleotide sequence and the amino 
acid sequence of the human enzyme was lacking, a novel 
method was derived which relied on the known amino acid 
sequence -of -the bovine enzyme... 

In this method. DNA probes were designed so as 
to have at least moderately high homology with two 
regions of the bovine enzyme. Degeneracies in the code 
were taken into consideration and the probes designed 
around them, as discussed in the Experimental section, 
infra. These DNA probes were then used to isolate by 
nucleic acid hybridization and cloning techniques, a 
full length cDNA encoding bovine GPx. Using the cDNA 
encoding bovine GPx as a probe. cDNAs encoding human GPx 
were isolated from a \-cDNA library carrying EcoRI 
recombinant inserts derived from human kidney. The 
cDNAs encoding human GPx were subcloned. and sequenced. 
The sequence for human GPx. as derived from these 
clones, is presented in Fig. 2. 

An outstanding feature of the .DNA sequence for 
human GPx is the presence^of a single opal translation 
stop codon (TGA) in frame within the coding region. 
This feature is also present in the DNA sequences for 
bovine and mouse GPx. 

The amino acid sequence for human GPx was 
deduced from the nucleotide sequence encoding it. It 
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may be deduced from the amino acid sequence (shown in 
Fig. 3) that human GPx probably does not enter the 
secretory pathway and that the polypeptide is not 
N-glycosylated . 

Although a method for preparing a DNA construct 
encoding human GPx based upon hybridization to a bovine 
cDNA probe has been provided, in the current invention 
the preparation of such constructs is not limited to 
this method. Utilizing the sequence information 
provided herein, other methods of preparing 
polynucleotide constructs encoding GPx may be devised. 
For example, the nucleotide sequence encoding human GPx 
may be synthesized utilizing automated DNA synthesis. 
See. -e.gT.. Edge (1981) Nature 2 92 . 756; Nambair et al 
(1984) Science 223 . 1299; Jay et al (1984) J. Biol. 
Chem. 259 . 6311. Alternatively, oligonucleotides 
containing a portion of the sequence information may be 
synthesized; these may then be used as probes to screen 
human genomic DNA libraries and cDNA libraries. The 
basic strategies for preparing oligonucleotide probes 
and DNA libraries, as well as their screening by nucleic 
acid hybridization, are well known to those of ordinary 
skill in the art. See. e.g.. DNA CLONING: VOL. I (D.P. 
Glover ed 1985); NUCLEIC ACID HYBRIDIZATION (B.D Hames S, 
S.J. Higgins eds. 1985); OLIGONUCLEOTIDE SYNTHESIS (M.J. 
Gate ed. 1984); T. Maniatis et al.. MOLECULAR CLONING: A 
LABORATORY MANUAL (1982); B. Perbal. A PRACTICAL GUIDE 
TO MOLECULAR CLONING (1984). 

Once a sequence encoding human GPx has been 
prepared or isolated, it can be cloned into any suitable 
replicon to create a vector, and thereby be maintained 
in a composition which is substantially free of vectors 
that do not contain the human GPx gene (e.g.. other 
clones derived from the library). Numerous cloning 
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vectors are Known. to those of skill in the art. and the 
selection of an appropriate cloning vector is a matter 
of choice. Examples of recombinant DNA vectors for 
cloning and host cells which they can transform include 
the bacteriophage \ (E. coli). pBR 3 22 (E. coli). 
pACYC 177 (E. coli). pKT 230 (gram-negative bacteria). 
PGV1106 (gram-negative bacteria). pLAFRl (gram-negative 
bacteria). pME290 (non-E. coli gram-negative bacteria). 
pHV 14 (E . coli and Bacillus subtilis). pBD9 (Bacillus). 
pU61 (Streptomyces). pUC6 (Streptomyces) . actinophage 
<i>C31 (Streptomyces. YIp5 (Saccharomyces . YCpl9 
(Saccharomyces. and bovine papilloma virus (mammalian 
cells).. generally . DNA CLONING: VOLS. I & II. 

sup_ra; T. Maniatis et al.. supra; B. Perbal. sufira. 

The polynucleotide sequence encoding human GPx 
is expressed by inserting the sequence into an 
appropriate replicon thereby creating an expression 
vector, and introducing the resulting expression vector 
into a compatible host. 

in creating an expression vector the human GPx 
sequence is located in the vector with the appropriate 
control sequences, which include a promoter, a ribosomal 
binding site, and transcriptional and transnational stop 
codons. The positioning and orientation of the coding 
sequence with respect to the control sequences is such 
that the coding sequence is transcribed under the 
-control" of the control sequences: i.e.. the promoter 
will control xhm transcription of the -mRNA derived from 
the coding sequence: and x the ribosomes will bind at the 
ribosomal binding site to begin the translational 
process: and the stop codon used to terminate 
translation will be upstream from the transcriptional 
termination codon. 
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In addition to control sequences, it may be 
desirable to add regulatory sequences which allow for 
regulation of the expression of the human GPx gene 
relative to the growth of the host cell. Examples of 
regulatory systems are those which cause the expression 
of a gene to be turned on or off in response to a 
chemical or physical stimulus, including the presence of 
a regulatory compound. In profcaryotic systems these 
would include the lac and trp operator systems. Other 
examples of regulatory sequences are those which allow 
for gene amplification. In eukaryotic systems, these 
include the dihydrof olate reductase gene which is 
amplified in the presence of methotrexate, and the 
methallothionein genes, which are amplified with heavy 
metals. In these cases, the sequence encoding human GPx 
would be placed in tandem with the regulatory element. 

Other types of regulatory elements may also be 
present in the vector, i.e. those which are not 
necessarily in tandem with the sequence encoding human 
GPx. An example is the SV 4Q enhancer sequence, which, 
by its mere presence, causes an enhancement of 
expression of genes distal to it. 

Modification of the sequence encoding human 
GPx, prior to its insertion into the replicon, may be 
desirable or necessary, depending upon the expression 
system chosen. For example, in some cases, it may be 
necessary to modify the sequence so that it may be 
attached to the control sequences with the appropriate 
orientation, i.e. to maintain the reading frame. In 
some cases, it may be desirable to add sequences which 
cause the secretion of the polypeptide from the host 
organism, with subsequent cleavage of the secretory 
signal. In addition, in some cases, it. may be desirable 
to alter the opal stop codon (reasons for this 
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alteration are discussed below). The techniques for 
modifying nucleotide sequences utilizing cloning ace 
well Known to those skilled in the art. They include, 
e.g.. the use of restriction enzymes, of enzymes such as 
Bal31 to remove excess nucleotides, and of chemically 
synthesized oligonucleotides for use as adapters, to 
replace lost nucleotides, and in site directed 
mutagenesis. See. e.g.. Maniatis. Fritsch & Sambrook.. 
MOLECULAR CLONING: A LABORATORY MANUAL (19 82): DNA 
CLONING. Volumes I and II (D.N. Glover, et al ed. 1985): 
NUCLEIC ACID HYBRIDIZATION (B.D. Hames & S.J. Higgins 
eds. 1984). 

Modification off. the sequence encoding human GPx 
is "also necessary if polypeptides substantially similar 
to human GPx are to be synthesized. These polypeptides 
differ in some engineered way from the native enzyme. 
E.g.. if a fragment of human GPx is the desired product, 
the sequence encoding the enzyme would be modified to 
remove the undesired sequences corresponding to the 
amino acids which are to be deleted. If an active 
fragment of human GPx is the desired product, the 
deleted sequences most likely would be in the regions of 
the amino, and/or the carboxy- terminus. However, the 
region encoding the selenocysteine moiety would be 
retained, as would other regions encoding portions of 
the enzyme which are necessary for activity. 

It may also be of interest to synthesize 
analogs of human GPx and analogs of fragments of human 
GPx. Such analogs may vary in their substrate 
specificity. E.g.. in pig heart there are apparently 
two forms of glutathione peroxidase, one of which 
catalyzes the reduction of H 2 0 2 and lipid peroxides, 
and another which catalyzes the reduction of 
phospholipid peroxides and intact peroxidized 
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membranes. Despite the difference in substrate 
specificity, the mechanism of action of both forms of 
enzyme appears to be the same, and the amino acid 
compositions are similar. Nevertheless, there is 
evidence suggesting that there may be variations in the 
active site of the two analogous enzymes. F. Ursini et 
al. (1985). Biochim . Biophvs. Acta 839 . 62. Thus, it 
may be desirable to modify the sequence encoding the 
active site of human GPx so that the product polypeptide 
will preferentially reduce one type of substrate, e.g.. 
phospholipid peroxides which are implicated in membrane 
damage . 

It may also be of interest to synthesize 
analogs or" fragments of human GPx which differ in their 
hydrophobicity. allowing greater or lesser interactions 
with membranes, or with liposomes. This may be 
accomplished by substituting hydrophobic amino acids for 
hydrophilic amino acids in some of the external domains 
of the polypeptide or vice versa. Such changes in 
hydrophobicity are accomplished by modifying the 
sequences encoding the specific amino acids which are to 
be substituted. 

Polypeptides which are substantially similar to 
human GPx or to its fragments but which lac* the active 
site may also be synthesized. In this case'~the sequence 
encoding the enzyme would be modified so that those 
codons encoding the amino acids of the active site would 
be deleted. . 

The above are examples of the way the human GPx 
polypeptide can be modified by modification of the 
sequence encoding human GPx. These examples are not 
meant to be exhaustive, and one skilled in the art can 
readily determine other modifications which would be 
useful. All of these modifications may be accomplished 
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using the above cited techniques and references 
concerning the modification of nucleotide sequences. 

The appropriately modified sequence encoding a 
polypeptide substantially similar to human GPx. 
including human GPx. may be ligated to the control 
sequences prior to insertion into a vector. 
Alternatively, the coding sequence can be cloned 
directly into an expression vector which already 
contains the control sequences and an appropriate 
restriction site. For expression of the human GPx-type 
polypeptide in prokaryotes and in yeast, the control 
sequences will necessarily be heterologous to the coding 
sequence. In addition, for expression in prokaryotes. 
the coding sequence will be free of introns. In cases 
where human GPx is to be expressed in cell lines derived 
from vertebrates, the coding sequence can be with or 
without introns. and the control sequences may be either 
heterologous or homologous. 

A number of replicons which may be used to 
construct prokaryotic expression vectors are Known in 
the art. See. e.g.. U.S. Patent Nos. 4.440.859; 
4 436.815: 4.431740. 4.431739; 4.428941: 4.425.437: 
4*418149: 4.422.994: 4.366. 246: 4.342. 832. Replicons 
which may be used to construct yeast expression vectors 
are also known in the art. See. e.g.. U.S. Patent Nos. 
4 446.235: 4.443.539; 4.430.428. An example of a 
replicon which can be used to construct an expression 
vector for mammalian host cells is described in commonly 
owned patent application no. 921.730. the disclosure of 
which is incorporated herein. 

The choice of an expression system is 
dependent, in part, upon whether or not selenocysteine 
is to be incorporated into the polypeptide during 
translation. If the modified amino acid residue is to 
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be incorporated translationally , a host system which 
contains the system to carry out such translational 
incorporation is required. Examples of such systems 
include a variety of immortalized mammalian cells, 
including mouse cells, monkey cells, and cell lines 
derived from hamster cells (3T3. Vero, Chinese Hamster 
Ovary cells). Primary cell lines derived from, 
vertebrates may also be used. It should be noted that 
in most cases these cell lines will contain directly the 
system for translating the opal stop codon, so that 
modification of the sequence encoding human GPx need not 
be altered to accommodate the translation. 

An example of a prokaryotic expression host 
which can directly incorporate a selenocys teine during 
translation is E. coli. In these organisms 
selenocysteine is translationally incorporated into 
formate dehydrogenase. F . Zinoni et al. (1986), Proc . 
Natl. A cad Sci. USA 83 . 4650. Moreover, the 
selenocysteine residue is incorporated in response to 
the opal nonsense codon (TGA) . 

In some cases, the host may contain the system 
for the translational incorporation of selenocysteine. 
but the triplet codon signaling the incorporation may 
not be the opal codon. If this- type of host is used, 
the sequence encoding human GPx will have to be modified 
to replace the opal codon with the appropriate signal 
codon. The techniques for modification of the 
polynucleotide sequence have been discussed supra. in 
other cases, a host may be selected which is able to 
modify a pre-existing tRNA to Se-Cys tRNA. 

If the selenocysteine residue is not to be 
incorporated translationally into the polypeptide, other 
expression hosts may be used. However, * in using these 
systems the sequence encoding the opal stop codon may 
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have to be modified to allow translation to progress, 
in this case, the modification will encode for either 
cysteine or another amino acid(s) in place of 
selenocysteine . 

Alternatively, the host will contain a 
suppressor system which allows translation of the opal 
stop codon. A variety of suppressor systems are Known 
in the art. Cf . D.A. Steege and D.G. Soli (1979) in 
BIOLOGICAL REGULATION AND DEVELOPMENT OF GENE EXPRESSION 
(RF. Goldberger. ed.) Vol 1. pp. 433-475 (bacterial 
suppressor systems). F. Sherman (1981) in THE MOLECULAR 
BIOLOGY OF THE YEAST SACCHAROMYCES : LIFE CYCLE AND 
INHERITANCE (J.N.. Strathern. E.W. Jones and J.R. Broach, 
eds.) pp. 463-486 (yeast suppressor systems), and D. 
Hatfield. (1985) *^ Ql - Sci - 201-204 

(suppressor systems in higher eukaryotes). These 
suppressor systems may be naturally occurring in the 
host, or may be incorporated into the host via vectors, 
either naturally occurring or engineered. 

Selenium may be incorporated into human GPx 
post-translationally. One method for post-translational 
incorporation would be by synthesizing the polypeptide 
fragments which flank the selenocysteine molecule, and 
then joining in the proper orientation, the two 
polypeptides to the selenocysteine via peptide 
linkages. Another method would entail the incorporation 
of the selenium moiety into the appropriate site of a 
polypeptide analog of human GPx. which does not contain 

the selenium moiety. 

Depending on the expression system and host 
selected, a polypeptide which is substantially similar 
to human GPx , including human GPx. or an analog of human 
GPx or a fragment of human GPx. is produced by growing 
host cells transformed by an expression vector described 
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above under conditions whereby the polypeptide is 
expressed. The synthesized polypeptide is then isolated 
from the host cells and purified. If the expression 
system secretes the enzyme into growth media, the 
protein can be purified directly from the media. If the 
recombinant polypeptide is not secreted, it is isolated 
from cell lysates. The selection of the appropriate 
growth conditions and recovery methods are within the 
skill of the art. 

Isolation of the newly synthesized polypeptide 
depends upon an assay system by which the polypeptide 
may be detected. These assay systems would be obvious 
to one skilled in the art. For example, if the newly 
synthesized polypeptide exhibits glutathione peroxidase 
activity, the polypeptide can be detected by assaying 
for that enzymic activity. Cf K. Takahashi et al. 
< I986 > J - Clin. Invest. 77. 1402. for an assay for human 
GPx activity. 

It is also possible to detect the newly 
synthesized polypeptide by immunoassay using antibodies 
to polypeptides substantially similar to human GPx. 
including human GPx. In this case, the type of antibody 
used in the assay will reflect the expected presence or 
absence of specific known epitopes. E.g.. if the newly 
synthesized polypeptide is an analog of human GPx or a 
fragment of human GPx which lacks the seleno moiety, the 
preparations of antibodies used for detection would 
contain those directed towards parts of the molecule 
other than the epitope containing selenocysteine . On 
the other hand, if the newly synthesized polypeptide is 
human GPx including substantially similar polypeptides, 
or an active fragment thereof, all of which may be 
expected to contain the selenocysteine laoiety. the 
preparations of antibodies used for immunoassay would 
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contain those directed to the epitope containing the 
selenocysteine moiety. The techniques of immunoassay 
are well known to those skilled in the art., and a 
method of preparing polyclonal antibodies to human GPx 
has been described. See K. Takahashi. ,7. Clin. Invest ., 
supra . 

in general, recombinant production of human GPx 
can provide compositions of that enzyme substantially 
free of contaminating proteins, i.e. of at least 95% 
purity. The ability to obtain high levels of purity is 
a result of recombinant expression systems which can 
produce human GPx in substantial quantities vis-a-vis in 
vivo sources. Thus, by applying conventional techniques 
to recombinant cultures, human GPx compositions of 
substantial purity and amount are obtainable. 

It should be noted, that with the sequence data 
of the present invention, production of human GPx is not 
restricted to recombinant methods. It may also be 
synthesized by chemical methods such as solid phase 
peptide synthesis. Such methods are known to those 

skilled in the art. 

The synthetic polypeptides which are 
substantially similar to human GPx. including human GPx. 
can be used to produce antibodies, both polyclonal and 
monoclonal. If polyclonal antibodies are desired, a 
selected mammal (e.g.. mouse, rabbit, goat, horse, etc.) 
is immunized with purified human GPx polypeptide or 
fragment thereof, or analog thereof, or fragment of an 
analog thereof. Serum from the immunized animal is 
collected and treated according to known procedures. If 
serum containing polyclonal antibodies to human GPx 
contains antibodies to other antigens, the human GPx 
polyclonal antibodies can be purified by immunoaf f inity 
chromatography. See. e.g.. K. Takahashi. J • Clin. 
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Invest.. supra. . foe a method of preparing polyclonal 
antibodies to human GPx. 

Monoclonal antibodies to human GPx can also be 
readily produced by one skilled in the art. The general 
methodology for making monoclonal antibodies by 
hybridomas is well known. Immortal antibody-producing 
cell lines can be created by cell fusion, and also by 
other techniques such as direct transformation of B 
lymphocytes with oncogenic DNA. or transection with 
Epstein-Barr virus. See. e.g.. M. Schreier et al.. 
HYBRIDOMA TECHNIQUES (1980); Hammerling et al.. 
MONOCLONAL ANTIBODIES AND T-CELL HYBRIDOMAS (1981): 
Kennett et al.. MONOCLONAL ANTIBODIES (1980); see also 
U.S. Patent Nos. 4.341.761; 4.399.121: 4.427.783: 
4.444.887; 4.452.570; 4.466.917; 4.472.500. 4.491.632: 
and 4.493.890. Panels of monoclonal antibodies produced 
against human GPx can be screened for various 
properties: i.e.. for isotype. epitope, affinity, etc. 
Monoclonal antibodies directed against specific epitopes 
are useful in defining interactions of human GPx with 
its various substrates. in addition, monoclonal 
antibodies are useful in purification, using 
immunoaff inity techniques, of native or recombinantly 
produced human GPx. 

In the preparation of antibodies, prior to 
immunization, it may be desirable to increase the 
immunogenicity of human GPx. or particularly of its 
fragments. This can be accomplished in any one of 
several ways known to those of skill in the art. For 
example, the antigenic peptide or polypeptide may be 
administered linked to a carrier. For example, a 
fragment may be conjugated with a macromolecular 
carrier. Suitable carriers are typically large, slowly 
metabolized macromolecules such as: proteins: 
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polysaccharides. such as latex f unctionalized sepharose. 
agarose, cellulose, cellulose beads and the like: 
polymeric amino acids, such as polyglutamic acid, 
polylysine. and the like: amino acid copolymers: and 
inactive virus particles.. Especially useful protein 
substrates are serum albumins. Keyhole limpet 
hemocyanin. immunoglobulin molecules, thyroglobulin. 
ovalbumin, and other proteins well known to those 

skilled in the art. 

The protein substrates may be used in their 
native form or their functional group content may be 
modified by. for example, succinylation of lysine 
residues or reaction with Cys-thiolactone . A sulfhydryl 
group may also be incorporated into the carrier (or 
antigen) by. for example, reaction of amino functions 
with 2-iminothiolane or the N-hydroxysuccinimide ester 
of 3-(4-dithiopyridyl) propionate. Suitable carriers 
nxay also be modified to incorporate spacer arms (such as 
hexamethylene diamine or other bifunctional molecules of 
similar size) for attachment of peptides. 

If human GPx or polypeptides substantially 
similar to human GPx are to be used therapeutically, it 
ma y be desirable to link the polypeptide molecule to an 
efficient system to deliver the GPx to the appropriate 
site, and which will also protect the polypeptide from 
proteolysis, and at the same time cause a controlled 
delivery of the polypeptide. Systems for the delivery 
of molecules are known to those of skill in the art. and 
are reviewed, for e.g.. in Poznansky. M.J.. et al. m 
DRUG DELIVERY SYSTEMS (R.L. Juliano. ed. Oxford. N.Y.. 
1980). pp. 253-315. and inPoznansky. M.L. (1984). Pharm 

Revs 36. . 277. 

An example of a delivery system which may be 
utilized for a polypeptide substantially similar to 




human GPx is a liposome delivery system. Liposome 
delivery systems have been proposed for a variety of 
therapeutic products, particularly those which are 
administered parenterally . Liposomes have the potential 
of providing a controlled "depot" release of the 
administered GPx polypeptide over an extended time 
period, and of protecting the administered polypeptide 
from protease degradation. Liposomes can also alter the 
tissue distribution and uptake of the administered 
polypeptide. In addition, targeting macromolecules can 
be added to the liposome composition. 

Several methods for preparing liposomes with 
entrapped compounds are known, and may be used for 
preparations of liposomes containing a polypeptide 
substantially similar to human GPx . For example, in one 
method, vesicle forming lipids are deposited as a thin 
film on the sides of a flask, and slowly rehydrated by 
addition of an aqueous buffer. The compound to be 
entrapped may be included either in the lipid film or in 
the aqueous hydration medium, depending upon the 
hydrophobicity and stability of the compound. The 
liposomes that form are multilamellar vesicles (MLVs) 
having heterogeneous sizes between about 0.05 and 10 
microns. 

The MLVs may be subsequently processed, 
typically by homogenization. sonication, or membrane 
extrusion, to produce smaller particles and a more 
uniformly sized suspension. Liposome sizing down to 
about 0.2-0.4 microns is ^generally preferred. Liposomes 
in this size range can be sterilized by passage through 
a 0.45 micron depth filter, have less tendency to 
aggregate, and also may show more favorable organ 
distribution when administered intravenously. Gabizon. 
A., et al (1982). Cancer Research 42. 4734. 
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Alternative methods foe preparing liposomes are 
also known. E.g.. they may be prepared by solvent 
injection, in which a lipid-in-solvent solution is 
injected into an aqueous medium, as described by. for 
example. Cafiso. D.S. (1981). Biochim. Biophys . Acta 
649.. 129. They can also be prepared in a reverse 
evaporation phase method, as described by SzoKa. F. Jr.. 
et al. (1980). Ann. Rev. Biophys. Eng. 9. 467. More 
specifically, a lipid-in solvent solution is mixed with 
an aqueous medium, and emulsified to form a water-m-oil 
emulsion. Removal of the lipid solvent produces a 
reverse-phase lipid gel which is then agitated, 
preferably in the presence of added aqueous medium, to 
form reverse-phase evaporation vesicles characterized by 
relatively large sizes and one to a few bilayer shells. 

Since encapsulation is usually relatively poor 
with hydrophilic substances, it may be desirable to 
remove the free polypeptide from the composition 
containing the liposome encapsulated polypeptide. This 
can be achieved by treating the liposomes by known 
methods, for example, by molecular sieve chromatography, 
by centrifugation. or by diaf iltration. 

The purified human GPx compositions of the 
present invention are useful in several regards. First, 
they can be used to overcome genetic deficiencies in the 
patient's enzyme, particularly in individuals who would 
suffer from hemolytic episodes as a result of exposure 
to drugs or xenobiotics which generate superoxide 
radicals. HO or lipid peroxides in red cells. 
They also have potential therapeutic use in overcoming 
the effects caused by ischemic trauma, as an adjunct to 
hypobaric oxygen treatment, as a treatment for 
cataracts, and in treatment of mental disorders related 
to aging, or which are currently treated with 
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chlorpromazine . Human GPx may also have therapeutic use 
in preventing the development of coronary heart disease, 
which is related to its affect on prostacyclin 
synthesis, as well as the deleterious effects of the 
deficiency leading to premature red cell aging and 
damage to the endothelial lining of blood vessels. 

Human GPx will also be useful as a tool in 
designing activators and inhibitors of the enzyme. 
These activators and inhibitors have several uses. 
E.g.,. they would be useful in delineating the mechanism 
by which human GPx acts with its several types of 
substrates, eg. lipid peroxides, peroxidized DNA, 

H 2°2 ' and the intecmed iates in the arachidonic 
cascade. In addition, therapeutically these enzyme 
modulators could be useful in modulating the 
inflammatory and hypersensitivity reactions which result 
from products produced by the arachidonic acid cascade. 

The sequence encoding human GPx may also be 
inserted into vectors which are useful for human gene 
therapy, and used as a treatment for individuals who are 
genetically deficient in the enzyme. These vectors 
carry the information which allow the insertion of the 
sequence into the treated individuals genome. The 
vectors may be either of the DNA type, e.g. SV 
derivatives, or of the RNA type, e.g. the RNA 
retroviruses. Methods of inserting sequences into these 
oncoviruses are known to those skilled in the art. 

Recombinant polypeptides substantially similar 
to human GPx. including human GPx and fragments thereof, 
are also useful for the synthesis of antibodies to these 
polypeptides: the methods for production of the 
antibodies has been discussed above. 

The antibodies to polypeptides* substantially 
similar to human GPx, including human GPx, would have. 
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n>any uses; e.g.. such antibodies, both polyclonal and 
monoclonal, would find use in the purification of human 
GPx by immunoaf f inity chromatography. 

The antibodies would also be useful foe the 
diagnosis of glutathione peroxidase deficiency. 
Moreover, since selenium deficiency causes a reduction 
in human GPx levels (K. Takahashi et al.. J. Clin. 
,n^'f.. supra ..) the antibodies would also be useful 
for diagnosing selenium deficiency in humans. 

in addition, the human GPx fragments and analog 
polypeptides as well as monoclonal antibodies to 
specific epitopes on glutathione peroxidase will be 
useful in-determining how the enzyme interacts with its 
various substrates, as well as how the enzyme may be 
targeted for hydrophilic and hydrophobic sites. This 
Knowledge will be useful in designing analogs of human 
GPx for specific therapeutic uses. E.g.. analogs may be 
designed which will affect the arachidonic cascade by 
virtue of limiting activity with substrates. Other 
types of analogs may be designed which are highly 
suitable for preventing membrane damage which is 
associated with lipid peroxidation. 

Described below are examples of the present 
invention which are provided only for illustrative 
purposes, and not to limit the scope of the present 
invention. In light of the present disclosure, numerous 
embodiments within the scope of the claims will be 
apparent to those of ordinary sKill in the art. 

Experimental 

Sequences encoding human GPx DNA. derived from 
cDNA. were cloned according to the following procedure. 
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1 . Materials and Methods 

Enzymes were purchased from commercial sources. 

and used according to the manufacturers' directions. 

Radionucleotides and nitrocellulose filters were also 

purchased from commercial sources. 

Plasmids were propagated in E. coli strain 
HB101. Lambda (\) bacteriophage gtlO was propagated 

in E. coli strain C600AHFL. 

DNA was sequenced by the dideoxynucleotide 
chain termination method in bacteriophage ml3 . Sanger 
et al. (1977). Proc. Natl. Acad. Sci. USA 74. 5463. To 
overcome problems with band compression, which are 
sometimes observed in GC rich regions. 7-deazoguanosine 
was used. Barr. P.. et al (1986) Biotechniques 4. 
428-432 . 

DNA was synthesized by automated oligomer 
synthesis on a glass support using an automated DNA 
synthesizer (B.W. Warner et al. (1984) DNA 3. 401-411). 
and nucleoside N.N-diisopropyl phosphoramidites 
(L. McBride and M. Caruthers (1983) Tetrahedron Lett. 
24. 245. And S.P. Adams, et al. (1983) J. Am. Chem. 
Soc. 105. 661-663). 

In the cloning of DNA fragments, all DNA 
manipulations were done according to standard 
procedures. See, Maniatis et al., MOLECULAR CLONING. 
supra . Restriction enzymes. T 4 DNA ligase. DNA 
polymerase I. and other biological reagents were 
purchased from commercial suppliers and used according 
to the manufacturers' directions. Double-stranded DNA 
fragments were separated on agarose gels and isolated by 
electroelution. 

cDNA libraries were prepared by standard 
techniques in \gtlO. See. T.V. Huynh et al., in DNA 
CLONING: VOL/1. (Glover. Ed.), supra . Double stranded 
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cDNA was prepared essentially as described. U. Gublet 
and B.J. Hoffman (1982). supra. After methylation of 
internal EcoRI sites and the addition of EcoRI linkers, 
the cDNA was ligated into \gtlO. T.V. Huynh et al. 
(1985) in DNA CLONING (D.M. Glover, ed) pp 49-78. The 
phage were packaged and recombinants selected by plating 

on E. coli strain BNN102. 

Nucleotide homologies between the cloned 
sequences encoding human, bovine, and mouse GPx were 
deduced using a computer alignment program. This 
program was also utilized to test for the presence of 
significant homology of these cloned sequences with 
sequences in the data base "organelles" wherein the 
sequences of the entire mitochondrial bovine and genome 
reside. Homologies of the proteins deduced from the 
cloned DNA sequences were deduced using a computer 
alignment program. 

2. noninq of ^NA Seque nce Encoding Bovine GPx 

A cDNA library derived from bovine pituitary 
was prepared. Recombinants carrying the gene encoding 
bovine GPx were selected by probing with two specially 
designed synthetic oligomers shown in Fig. 1. 

These oligomers were designed from the Known 
amino acid sequence of bovine GPx. W. A. Gunzler et al. 
(1984). See prev. so as to anneal to regions of the 
coding strand which possess low degeneracy in their 
code, namely regions coding amino acids 108 through 125 
as numbered in Figure 3 (for the 53-mer). and 81 through 
95 (for the 44-mer). Where degeneracies were 
encountered, the following prioritized principles were 
exercised in selecting a nucleotide: (1) consistency 
with the codon usage bias observed in bovine DNA was 
maintained; (2) in cases where potential mismatches 
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might arise. GT pairs rather than other mismatches were 
favored; and (3) the sequence 5'-CG-3', seldom observed 
in the coding strand of eukaryotes, was also presumed to 
occur infrequently, and was thus avoided. 

To select the recombinants carrying the bovine 
GPx sequences, the culture plates were replica plated in 
duplicate onto nitrocellulose filters. The filters were 
probed by hybridization with the oligomers which had 
been terminally labeled with 32 P, followed by washing 
in 4XSSC. 0.1% SDS at 50°C. for 60 min. The conditions 
for hybridization and for the terminal labelling of the 
probe are given in T. Maniatis et al.. Molecular Cloning 
Supra. Approximately 20 plaques gave strong signals; 10 
of these were isolated and plaque purifed by replating 
on E. coli strain BNN102 and sequencing of the DNA 
insert . 

The Eco RI insert DNA encoding bovine GPx from 
the 10 clones was subcloned into pBR322. using standard 
techniques. For DNA sequencing, the inserts were 
reexcised from the pBR322 with EcoRI. The sequence of 
the largest insert, 830 bp, is presented in Fig. 2. 

2 • Cloning of cDNA Sequences Encoding Human GPx 

A cDNA library derived from human kidney was 
prepared. Recombinants carrying sequences encoding 
human GPx were selected by using a probe derived from 
the 830 bp sequence encoding bovine GPx. 

More specifically, the 830 bp fragment encoding 
bovine GPx was excised from pBR322 with Eco RI. and 
isolated by gel electrophoresis. The fragment was 
radiolabeled by nick translation. See T. Maniatis et 
al., MOLECULAR CLONING, supra . Recombinants were 
selected by replica plating and lifting. DNA onto 
nitrocellulose in duplicate, followed by probing the 
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filters by hybridization with the r adiolabelled probe, 
and washing the filters in 0.1XSSC. 0.1% SDS at 25»C for 
60 mtn. To isolate clones, phage from 10 plaques were 
plaque purified, and the EcoRI insert DNA was 
sequenced. Three clones encoding fragments of human GPx 
were obtained: a clone containing a 550 bp insert 
ChGPxl) corresponding to the 3- end of the GPx mRNA; a 
clone containing a 280 bp insert (hGPx2) corresponding 
to the 5' end of the GPx mRNA: and a clone containing a 
560 bp insert (hGPx3). The 560 bp clone was found to 
correspond to a portion of 3' end of the GPx mRNA and 
also contained a 278 bp intron. . 

The EcoRI insert DNAs encoding human GPx from 
these clones was subcloned into the EcoRI site of 
PBR322. using standard techniques. For DNA sequencing, 
the inserts were reexcised from the pBR322 with EcoRI. 

4. nontna of r.TMA sequ ences Encoding Mouse GPx 

Utilizing a cDNA library prepared from mouse 
placenta, clones carrying sequences encoding mouse GPx 
were isolated by the procedures described for the 
isolation of clones carrying sequences encoding human 
GPx. described above. Using these procedures, two 
clones were isolated; one clone contained a fragment of 
"about 550 bp. and the other clone contained a fragment 
of about 300 bp. These inserts were subcloned into 
PBR322. and their DNA inserts sequenced after excision 
with EcoR I . 

5. Homologies rhe cPN * inserts Encoding Human, 

Mouse, and Bovine GPx . 

The nucleotide sequence encoding human GPx is 
shown in Fig. 2. which compares the human sequence with 
the bovine sequence and a mouse sequence. The cDNAs 
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used in deriving the sequence were those described 
above, i.e., the two human sequences which did not 
contain the intron, the two mouse sequences, and the 
full length bovine sequence. The sequences were derived 
from sequencing both strands of the cDNAs . However, the 
sequencing Was not done across the internal Eco Rl sites 
of the human and mouse DNA. Nevertheless, the homology 
with bovine sequences across these restriction sites and 
their flanking regions is high and contiguous, 
indicating that the mouse and human fragments are 
properly aligned. 

In Fig. 2. spaces indicated by hyphens have 
been introduced to maximize homology between the DNAs . 
Positions of identity are indicated by asterisks. The 
features of the sequences are highlighted as follows. 
Enclosed within boxes are the presumed trans lational 
start codon. ATG; the opal codon within the structural 
gene. TGA; and the codon used to stop translation. 
Putative polyadenylat ion signals are double-underlined. 
The. location of the intron in the human and the mouse 
sequences is shown as a vertical broken line and marked 
IVS. Nonover lapping inverted repeats greater than 6bp 
which are separated by less than 30 bp are shown by 
arrows. 

A comparison of the human, mouse, and bovine 
cDNA sequences shows the following. The GC contents of 
the human, mouse, and bovine cDNA are 62%. 62%. and 59%. 
respectively. in those codons where wobble with A or T 
is permitted in the thirds position. G or C is favored in 
that position 81%. 73%. and 82%. for human, mouse and 
bovine, respectively. Within the coding region, the 
nucleic acid homology for all three species is 
approximately 83%. although bovine carries a 15 bp 
insert (Fig. 2). The homologies within the other 
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regions reflect more variation. Within the 
3 '-untranslated region the homology is 71% 
(bovine/human). 69% (mouse/human), and 67% 
(bovine/mouse). In the 5 ■ -untranslated region, homology 
between bovine and mouse is approximately 35%. The 
human and mouse cDNAs possess a canonical 
polyadenylation signal ( AATAAA) at positions 799 and 
814. respectively. It is presumed that: the bovine cDNA 
clone did not reveal this feature because of its 
relative shortness. 

6. Localization of the genes encoding GPx 

The cellular location of the gene coding for 
GPx is of interest because of the unusual coding event 
which occurs. TGA-SeCys . Since GPx has two cellular 
locations, i.e. mitochondrial and extramitochondr ial . it 
is important to determine whether the genes encoding the 
cDNAs isolated are located in the mitochondrion or in 
the nucleus. The cellular location of the genes was 
established by a computer search of the total bovine and 
human mitochondrial genome libraries using sequences 
homologous to the isolated bovine and human GPx cDNAs 
The lack of appreciable homology of the isolated cDNAs 
to the mitochondrial libraries indicates that the genes 
encoding these GPxs are of nuclear origin. 

7. m.iveis of rhP Induce * *™i no Acia Sequences of 

Human. Mouse, an * bovine GPxs 

The amino acid sequence for human GPx. as 
deduced from the nucleotide sequence for cDNA encoding 
the polypeptide is shown in Fig. 3. which also shows the 
homologies of the mouse and bovine sequences to that of 
the human. In the Figure, maximum homology is achieved 
by including spaces, which are indicated by dashes. The 
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symbol # indicates infected selenocys te ine res idues 
(discussed below). The terminal amino acids of the 
mature bovine protein as deduced by chemical methods (W. 
A. Gunzler. et al. (1984) ( see page 3 ^ ) and by X-ray 
diffraction analysis (O. Epp. et al. (1983) Eur J. 
Biochem. 133 , 51-69) are shown by closed and open 
triangles, respectively. Potential glycosylat ion sites 
are indicated by underlined amino acids. The homologous 
location of the mouse and human introns is shown by an 
arrow (IVS). Amino acids which are proposed to bind 
glutathione or play a functional role during catalysis 
are shown by circles. The sequences are shown in 
single-letter code and correspond to the three letter 
code as follows: A, Ala; C. Cys ; D. Asp; E. Glu; F. Phe; 
G. Gly; H. His; I. He; K. Lys ; L. Leu; M, Met; N. Asn; 
P, Pro; Q t Gin; R, Arg; S. Ser; T, Thr ; V. Val; w. Trp; 
and Y. Tyr . 

The opal stop codon. TGA. within the cDNA 
sequence encoding human GPx encodes the unusual amino 
acid residue, selenocysteine . This is inferred from a 
comparison of the bovine cDNA sequence with the derived 
amino acid sequence (Figs. 2 and 3) and the published 
amino acid sequence (W. A. Gunzler, et al.. (1984) (see - 
page 3)). and the homologies between the human and 
bovine sequences. A similar conclusion has been reached 
for mouse GPx. I. Chambers et al (1986). EMBO J . 5. 
1221. 

Based upon the deduced sequences, preprocessed 
human GPx contains 201 amino acids. In comparison, 
preprocessed bovine and mouse GPx contain 205 and 201 
amino acids, respectively (Fig. 3). Approximately 87% 
of all human residues are homologous with the bovine 
residues, although bovine GPx possesses. a 5 amino acid 
insert. Homology between human and mouse residues is 
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85%. An additional 7 to 10%. not exhibiting identity 
between pairs of species, arise from conservative 
replacements in the genetic code. It has been proposed 
that three bovine residues. Arg-57. Met-147 and Arg 184. 
bind glutathione during catalysis and that Trp-165 is 
involved catalytically. These residues are apparently 
conserved in both human arid mouse GPx. 

Certain structural features of human GPx may 
also be inferred from the cDNA sequences and the deduced 
amino acid sequences. First, nucleotide sequencing of 
one of the human GPx cDNA EcoRI inserts showed the 
pcesence of a 278 bp intron which exists between the 
nucleotides coding for Gln-82 and Glu-83 (Figs. 2 and 
3). This "position is homologous to a 216 bp intron in 
mouse. ™*™hgr 6 et al (1986). EMBO J ., sufira. A 
proposed function of introns is to separate two protein 
domains. In the case of human GPx. the intron would 
separate the a^helix of the polypeptide from the 
a -helix and the remainder of the polypeptide. 

2 The analysis of the deduced amino acid sequence 

also shows features which are relevant to processing of 
the nascent human GPx. First, the N-terminus of the 
nascent polypeptide lacks potential dibasic processing 
signals, suggesting a lack of a specific cleavage 
event. In addition, the classical features of a 
potential N-terminal secretory signal-sequence with its 
accompanying cleavage site are also lacking. Thus, it 
would appear that human GPx does not enter the secretory 
pathway. Finally there is a lack of potential 
N-glycosylation sites (Asn-X-Ser/Thr ) . suggesting that 
the polypeptide is not N-glycosylated . 

Analysis of the deduced amino acid sequences of 
bovine and mouse GPxs shows that the above also pertains 
to those enzymes, with one exception. Both the bovine 
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•■"««• -loo acid seguences contain potentia! 
N- 9 lycoeylation sites. Houever _ b 

«• highly homologous. and because HI H « n 

of the locates of the potential glycosylate sites 

7' m " k6d m ".ic Positions, 

tha the .o„ se ana bovine enzy.es also are not' 
N-glycosylated. 

HBlol(phGPxl). and HBl<u<p h GPx2) have been 

(MCCJ. 12301 PatKlavn Dr.. EoCcville. Maryland 20*52 
and have been assigned Accession Nos. 

and . ■ .* 

the I so bp .„ " " Sfectlvel y- HBlOKphGPxl) contains 
hbio, —aing. a portion of human GP*. 

HB10l,ph O p x2) contains the 200 bp insert encoding a 
Port.on of human GPx. These deDoslts „. ^.J/ 
convenxence only, and are not reared to practic Z 
Present mention in view of the description here n 
The human SPx DNA sequences in ^ »• 

are incorporated herein by reference. 

While the present invention has been 
mustrated above by certain specific embodiments it is 

s op T I" Chat CheSe SPeClIlC »-it 
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1. A polynucleotide construct encoding a 
polypeptide substantially similar to human glutathione 
peroxidase (GPx) . said polynucleotide construct 
comprising a heterologous region, said heterologous 
region comprising a coding sequence for said polypeptide 
substantially similar to human GPx. wherein the 
polypeptide substantially similar to human GPx encoded 
within the heterologous region is selected from a group 
consisting of human GPx. a fragment of human GPx. an 
analog of human GPx. and an analog of a fragment of 
human GPx.: 

2. The polynucleotide construct of claim 1. 
wherein the polynucleotide is double stranded DNA. 

3. The DNA construct of claim 2 wherein the 
coding sequence is for human GPx. 

4. The DNA construct of claim 3 wherein the 
coding sequence is that shown in Figure 2. 

5. The polynucleotide construct of claim 1. 
wherein the construct is further comprised of sequences 
necessary for replication of the construct. 

6. The polynucleotide construct of claim 5. 
wherein the construct is further comprised of sequences 
allowing for expression of the coding sequence, said 
sequences comprising: 

a promoter sequence: 

a ribosomal binding sequence: 
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a sequence which causes the cessation of 
translation; and 

a sequence which causes the cessation of 
tr anscr ipt ion. 



7. The polynucleotide construct of. claim 6, 
wherein the sequences allowing expression of the coding 
sequence allow expression in eufcaryotic cells. 

8. The polynucleotide construct of claim 7, 
wherein the sequences allowing expression of the coding 
sequence allow expression in mammalian cells. 

9". The polynucleotide construct of claim 7 , 

wherein the sequences allowing expression of the coding 
sequence allow expression in yeast cells. 

10. The polynucleotide construct of claim 6. 
wherein the sequences allowing expression of the coding 
sequence allow expression in prokaryotic cells. 

11. An expression host harboring the 
polynucleotide construct of claim 6. 

12. A composition comprising cDNA encoding 
human GPx. 



polypeptide subs tantially v similar to human GPx, wherein 
polypeptides substantially similar to human GPx is 
selected from a group consisting of human GPx. a 
fragment of human GPx, an analog of human GPx , and an 
analog of a fragment of human GPx, the method comprising 
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A method of producing a recombinant 
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a. providing a population of transf ocmed cells 
containing a vector, said vector comprising 

(1) a coding sequence for said polypeptide 
substantially similar to human GPx, and 

(2) sequences allowing expression of said 
coding sequence in said cells, said sequences 
allowing expression comprising a promoter which 
controls transcription of said coding sequence, 
a ribosomal binding sequence, a translation 
termination sequence, and a transcription 
termination sequence. 

b. growing said population of transformed 
cells under conditions whereby said polypeptide 
substantially similar to human GPx is expressed; and 

c. recovering said polypeptide substantially 
similar to human GPx* 

14. The method of claim 13 wherein said cells 
are proKaryotic. 

15. The method of claim 13 wherein said cells 
are eukaryotic . 

16. The method of claim 15 wherein the 
eukaryotic cells are yeast cells. 

17. The method of claim 15 wherein the 
eukaryotic cells are mammalian cells. 



18. A composition comprising a polypeptide 
synthesized by expression of the polynucleotide 
construct of claim 1. 
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19. A composition according to claim 18, 
further comprising a raacromolecular structure with which 
the polypeptide is associated, wherein said 
raacromolecular structure is a tissue specific targeting 
agent for the polypeptide. 



20. A composition according to claim 18 , 
further comprising a structure in which the polypeptide 
is encapsulated. 

21. A composition according to claim 20, 
wherein the structure is a liposome. 

22. A composition according to claim 20, 
further comprising a macromolecular structure with which 
the polypeptide is associated, wherein said 
macromolecular structure is a tissue specific targeting 
agent for said polypeptide. 
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S ■ GCCTTCTCCCCATTCACCTCACACTTCTCAAACAGCAT 
AAAGTTAGGCTCAAA 3 • 
( 53-mer ) 



ACGTTGGTCAAACCGGTGGTCCTTTTGCGGTTTTTGCTTCTTTA 5 

(44-raer) 



Figure 1 
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